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Abstract 

Here we prove the convergence of the Ando-Li-Mathias and Bini-Meini-Poloni 
procedures for matrix means. Actually it is proved here that for a two- 
variable function which maps pairs of positive definite matrices to a positive 
definite matrix and is not greater than the square mean of two positive def- 
inite matrices, the Ando-Li-Mathias and Bini-Meini-Poloni procedure con- 
verges. In order to be able to set up the Bini-Meini-Poloni procedure, a 
weighted two-variable matrix mean is also needed. Therefore a definition 
of a two-variable weighted matrix mean corresponding to every symmetric 
matrix mean is also given. It is also shown here that most of the properties 
considered by Ando, Li and Mathias for the n-variable geometric mean hold 
for all of these n-variable maps that we obtain by this two limiting process 
for all two-variable matrix means. As a consequence it also follows that the 
Bini-Meini-Poloni procedure converges cubically for every matrix mean. 

Keywords: means, operator means, geometric mean 
PACS: 15A24, 15A45, 26E60, 47A64 

1. Introduction 

Recently several researchers were considering the extension of means to 
multiple variables. Most of the contributions aimed at the geometric mean 
of positive definite matrices. Every extension is based on the two-variable 
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form of the geometric mean defined by the theory of Kubo and Ando which 
characterizes a wide family of two- variable matrix means. These two- variable 
means are isomorphic to normalized operator monotone functions, therefore 
they are well characterized by the theory. 

The geometric mean is of great interest, since it is less trivial than the 
arithmetic or harmonic mean of positive matrices and at the same time it 
also has a corresponding nonpositively curved Riemannian metric for which 
it is the midpoint operation. This metric provides a great tool to extend 
the two-variable geometric mean of matrices to several variables. For in- 
stance with the aid of this corresponding metric structure Ando-Li-Mathias 

Bini-Meini-Poloni Jung- Lee- Yamazaki and Moakher jsf were able 
to define the geometric mean of several positive operators. The first three 
constructions are based on different iterative procedures which define the 
mean as a limit point for n > 3 variables. The geometric n-mean of Moakher 
or the so called Riemannian mean is defined as the center of mass of the n 
matrices. 

All of the procedures extend to metric spaces where the two-mean fulfills 
some geometrical properties, or the space itself has some additional geomet- 
rical properties. For example Lawson and Lim in jsl were able to extend 
the Ando-Li-Mathias procedure to metric spaces where the two-mean fulfills 
some geometric property expressed in an inequality closely related to Buse- 
mann nonpositive curvature. Actually the Ando-Li-Mathias procedure was 
considered by Petz and Temesi as extensions of matrix means to several 
variables, however the convergence of the procedure was only proved there 
for orderable tuples. 

Recently the author in ^ defined means of several matrices based on an 
iterative procedure relying on a properly chosen infinite sequence of graphs 
G. This procedure extends every two-variable matrix mean given by the 
Kubo-Ando theory to several variables. The applicability of the extension 
has been proved in full generality. This procedure was later extended to 
geodesic metric spaces or more specifically C^-domains where the two-mean 



is considered as the midpoint map of the space [10|. In this article we prove 



that the Ando-Li-Mathias (ALM) and Bini-Meini-Poloni (BMP) procedure 
converges for any matrix mean. Since the BMP-procedure requires the exis- 
tence of a weighted matrix mean, we therefore give a sufficient definition of 
a two- variable weighted matrix mean which agrees with the theory of Kubo- 
Ando. 
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2. Matrix means in two- variables 



In this section we recall the Kubo-Ando theory of matrix means. We 
investigate some properties of these means that will be useful in the following 
sections. By the foundational article jsj we consider two-variable functions 
on the open convex cone of hermitian positive-definite r x r matrices P(r). 

Definition 1 (Matrix mean). Let M : P{r)xP{r) t-^ P(r). Then M{A,B) 
is called a matrix mean if the following conditions hold 

(i) M(/, I) = I where I denotes the identity, 

(ii) if A < A' and B < B\ then M{A, B) < M{A\ B'), 

(iii) CM{A,B)C < M{CAC,CBC) for all hermitian C, 

(iv) if An i A and B^ i B then M(A„, B„) i M{A, B). 

As consequences of the above properties we can derive further properties of 
matrix means: 

(v) i{A<B then A < M{A, B) < B, 

(vi) SM{A,B)S* < M{SAS*,SBS*) for S not necessarily hermitian, 

(vii) M{CAC% CBC*) = CM{A, B)C* if C is invertible, 

(viii) M{A, B) + M(C, D) < M{A + C,B + D), 

and the most important consequence of the Kubo-Ando theory of means is 
property 

(ix) MiA,B) = A^/^f ^1/2, 

where / is a non-negative operator monotone function which is normalized 
so /(/) = /. The last property provides an isomorphism between means of 
two hermitian positive definite matrices and normalized operator monotone 
functions. As an immediate consequence we have that every matrix mean 
M[A, B) is continuous. The proof of properties (vi)-(ix) can be found in 
[sj. Property (v) is a direct consequence of property (i),(ii) and (ix), since if 
A< B then 

A = M{A,A) < M{A,B) < M{B,B) = B. (1) 

Kubo-Ando theory considers not necessarily symmetric matix means, so 
generally M{A, B) ^ M(B, A). Although it turns out, according to [5|, that 
if M{A, B) is symmetric then we have 

(^£l±Ely\_MiA.B)<^. (2) 
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In other words the smallest symmetric mean is the harmonic mean, the 
largest is the arithmetic mean with respect to the standard positive definite 
order. This fact is the consequence of the isomorphism between normalized 
operator monotone functions and matrix means. 

3. Weighted matrix means in two- variables 

Although Kubo-Ando theory characterizes all possible matrix means, it 
does not clarify how weighted means correspond to symmetric ones. In some 
special cases we have natural correspondence between a symmetric mean and 
some other weighted means. In most cases this correspondence is based on 
some affine geometric structures. For instance the weighted arithmetic mean 

At(A,B) ^ (l-t)A + tB (3) 

is the geodesic line connecting A and B with respect to the standard Eu- 
clidean metric corresponding to the space of complex squared matrices. This 
metric has the form {X^Y)^ — Tr{XY*} on each tangent space. Another 
related example is the weighted harmonic mean 

HM,B) = {{l-t)A-'+tB-Y' (4) 

and it is also a geodesic with respect to the metric {X, Y)^ — Tr {p~^Xp~^Y}, 
where p G P{r) and X, Y arc hermitian. 

There is another important example which is the weighted geometric 
mean 

Gt{A, B) = (^-1/2^^-1/2)* ^1/2_ 

The corresponding Riemannian metric is {X,Y)^ — Tr {p~^Xp~^Y}, where 
again p G P(r) and X, Y are hermitian. This manifold is nonpositively 
curved while the other two is Euclidean, therefore they have zero curvature. 

In the above three cases the structure of the underlying affincly connected 
manifold gives us the natural correspondence between the symmetric and 
weighted counterparts of the matrix mean. In these cases the geodesic lines 
7 : [0, 1] I—)- P{r) provides us the weighted matrix means. We are going to 
use the following definition to identify these special means. 

Definition 2 (Affine mean). An affine mean M : i-> is a geodesic 
midpoint operation M{A,B) — exp^(l/2 log^(i?)) on a smooth manifold 
W equipped with an affine connection, where B is assumed to be in the 
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injectivity radius of tlie exponential map exp^(x) of the connection given at 
the point A. The mapping log^(x) is just the inverse of the exponential map 
at the point A eW. 

In full generality it is not known whether such affine manifolds exists for 
every symmetric matrix mean where the mean is the midpoint operation 
on the manifold. This appears to be a delicate problem however we can 
borrow some ideas based on the above three examples to define weighted 
mean counterparts for every symmetric matrix mean. 

First of all we define a procedure for every symmetric matrix mean 
M{A,B) and for all t G [0,1]. Our procedure will be based on the fact 
that every t G [0, 1] can be approximated by dyadic rationals ^ since dyadic 
rationals are dense in [0, 1]. 

Definition 3 (Weighted mean process). Let M(-, ■) be a symmetric ma- 
trix mean, A,B E P{r) and t G [0, 1]. Let Oq = and 60 = 15^0 = ^ and 
Bq = B. Define a„, 6„ and An, Bn recursively by the following procedure for 
alln = 0, 1,2, 



if a„ = t then 










else if bn = t then 


An 


+1 = An and Bn+i 


— An 




ctn+i = bn and = 6„, 
else if < t then 


An- 


fi = Bn and Bn+i 


= Bn 




fln+i = and bn+i = 
else 


bn, 


An+l = M{An,Bn) 


1 and Bn+i 


= Bn 


K+i = and a„+i = 
end if 




Bn+l = M{An, Bn 


) and An+i 


= An 



According to the above a„+i,6„+i and An+i,Bn+i are clearly defined with 
respect to a„, 6„ and An, Bn recursively. 

This algorithm may also be regarded as a kind of binary search with recur- 
rence relation: 

if t = then 

MtiA, B) = M {Mt, (A, B), Mt, {A, B)) 

end if 

Theorem 1. The sequences An and Bn given in Definition{3 are convergent 
and have the same limit point. 
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Proof. In the case if t = m2~^ for some integer m and k then there is 
nothing to prove, the procedure converges after finite steps. So suppose that 
t is not a dyadic rational. We will make use of the following multiplicative 
metric on P(r) [1] 

R{A,B) = ma.x{p{A^^B),p{B-^A)} (6) 

for all A,B E P{r) and p{A) denotes the spectral radius of A. The above 
metric has the following properties jif 

1. R{A,B) > 1, 

2. R{A,B) = liSA = B, 

3. R{A,C) < R{A,B)R{B,C), 

4. R{A, B)-^A <B< R{A, B)A, 

5. \\A-B\\ < iR{A,B) - 1) \\A\\. 

Since R{A, B) = R{I, A-^I'^B A'^l'^) we have 

R (A, M(A, B)) = R{l,f {A-'/'BA-'/')) , (7) 

where f{t) is the corresponding normalized operator monotone function. Now 
since M{A, B) is symmetric (|2]) holds. From this for the corresponding nor- 
malized operator monotone function f{t) we have 

2{l + X-y'<fiX)<l±^. (8) 

This also yields that R{I,f{X)) < max |p(^), p(^^±f^)| = 1±3^ for 
every X G P{r), so 

R{A,M{AB))<^-±^^. (9) 

By the above inequality we can easily conclude the following for the sequences 
A R 

R{A„^u Bn+i) < ^ + -^(^"'^») =1 + 1 - 1] 

R{A^,B^)<l + ^[R{Ao,Bo)-l] 

f (10) 

RiAn, A„+i) < 1 + ^ [RiAo, Bo) - 1] . 
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There exists K G P(r) such that A < K^B < K and by property (ii) of 
matrix means An < Bn < K so hj property 5. of -R(-, ■) 

\\An+l-An\\ < iRiAn+l,An) - 1) \\K\\ 

Pn+l-^nll <^[R{Ao,Bo)-l] \\K\\ 

CO oo (11) 

J2 -AM<J2^ [RiAo, Bo) - 1] II = 

n=0 n=0 

= 2[R{Ao,Bo)-l] \\K\\. 

This means that A^ is a Cauchy sequence therefore convergent and by the 
above we also have that ||yl„ — i?„|| — > so both An and Bn converge to the 
same hmit point. □ 

We will base our weighted mean on the above theorem. 

Definition 4 (Weighted mean). The common limit point of in 
Theorem [1] will be denoted by Mt{A,B) and from now on in the article 
is considered as the corresponding weighted mean to a symmetric matrix 
mean M(-, ■). 

What are the properties of this weighted mean? First of all it is not hard to 
prove the following 

Proposition 2. Mt{A,B) yields the correct corresponding weighted means 
in the case of the arithmetic, geometric, harmonic means. 

The above is a consequence of the affine geodesy of the corresponding man- 
ifolds mentioned above. There are further important properties which are 
fulfilled by Mt(A,5): 

Proposition 3. Mt{A, B) for A,B G P(r) andt G [0, 1] fulfills the following 
properties 

(i') MtiI,I) = I, 

(ii') if A < A' and B < B' , then Mt{A, B) < Mt{A',B'), 
(iii') CMt{A,B)C < Mt{CAC,CBC), 
(iv') if An I A and Bn I B then Mt{An, Bn) I Mt{A, B), 

(v') zfN{A,B) < M{A,B) then Nt{A, B) < Mt{A,B), 
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(vi') My2{A,B) = M{A,B), 
(vii') Mt{A,B) is continuous int, 

Proof. Property (i') and (ii') are trivial consequences of the similar prop- 
erties for symmetric matrix means. 

For property (iii') consider A' = CAC and B' = CBC and start the 
procedure in the definition of Mt(-,-) for the pair A,B and A',B'. Then 
we have CA^C = CM{Ao, Bq)C < M{CAoC, CBqC) = A\ if t > 1/2 or 
CBiC = CM{Ao,Bo)C < M{CAqC,CBoC) = B[. Now for every n we 
use property (ii) for symmetric matrix means so we have CAnC < A'^ and 
CBnC < B'^ for every n>l. Taking the limits we conclude the assertion of 
property (iii')- 

What immediately follows from this property is that M{CXC*, CYC*) = 
CM{X, Y)C* for all invertible C. We will use this to show that Mt{A, B) is 
continuous in A, B so by property (ii') and the continuity we get property 
(iv') as a consequence. Actually we prove more, we will show that for a 
function 

Lemma 4. F : P{r)^ ^ P{r) which satisfies properties 

1. z/X, < XI for all i, then F{X,, . . . ,X„) < F{X[, . . .,X'J, 

2. F{cXi, cX„) = cF(Xi, . . . , X„) for real c> 0, 

it follows that F is continuous. 

Proof. We know that for a function f : Yi ^ Y2 between two metric spaces 
(Yi, di) and {Y2, ^2) sequential continuity and the usual topological continuity 
are equivalent. A proof can be found for example in We will show that 
sequential continuity holds therefore arriving at the desired result. 

We extend the metric R{-, ■) to P(r)" as follows. Let X = {Xi, . . . , X„) e 
P(r)" and Y = (Yi, . . . , y„) G P(r)", then we define 

R4X, Y) = max {R{Xi, Yi)} . (12) 

l<i<n 

Now suppose we have a convergent sequence of tuples X'^ = (X^, . . . , X^) e 
P(r)" for which (Xf , . . . , X^) ^ (Xi, . . . , X„) = X e P(r)". Using property 
(iv) of R{-,-) we have the following inequalities 

p„(x^ x)-ixf < X, < p„(x^ x)xf . (13) 
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Now applying the monotonicity property 1. of F, we have with the notation 
Cfc = Rn{X'',X) the following 

F {c^'Xl . . . , c.-^X^ < F (Xi, . . . , < F (cfcX^, . . . , c.X^ . (14) 
Using property 2. we conclude that 

c^'F (Xf, . . . , X3 < F (Xi, . . . , X„) < c,F (X^ . . . , X3 . (15) 
Taking the limit /c — )■ oo we have — )• 1. This shows that 

lim F (X^ . . . , X^) = F f lim X^ . . . , lim X^) (16) 

which is sequential continuity therefore arriving at the assertion of the lemma. 

Since the properties needed in the above lemma holds for Mt{A,B) we 
conclude by the above that property (iv') must also hold. At this point we 
already have by the Kubo-Ando theory of matrix means that Mt{A, B) is a 
matrix mean as well, so it fulfills the additional properties (v)-(ix). Conse- 
quently it has a representation with a normalized operator monotone func- 
tion. 

Property (v') is an easy consequence of repeated usage of property (ii) 
for matrix means for every n. Property (vi') is also trivial. 

To prove property (vii') we have to do a bit more work. We have to 
show that if |ti - ^2! then also \\Mt^{A, B) - Mt,^{A, B)\\ 0. Suppose 
ti < t2 and take the smallest j for which we have ti < m2^^ < t2 for some 
m. Let us set up the iterative procedure in Definition [3] on A,B with ti and 
t2 respectively. Let us denote the yielded matrix sequences in the case of ti 
with Al^ , BI^ and in the case of ^2 with A^^ , and similarly for the numbers 
with a , 6*^ and a*^ , 6*^ . Notice that the iterative procedure in the jth step 
for ti will yield 6*^ = m2~^ and similarly a*^ = m2~^ in the jth step for ^2- 
Suppose ti 7^ m2~^ . Then there exists i > j such that a*^ < h < 6*^ but 
+ ^*')/2 > h, this means that bl\^ ^ 6-^ If ti = 'm2~^ then we have 
a*i = 6*1 = tx for p > j and in this case we define i := +00. Similarly either 
there exists a smallest / > j such that a^'^^ 7^ a[^, or we have ^2 = rn2~^ so 
ttp = 6p = ^2 for p > j and again then we define I := +00. Notice that i 
and I cannot be infinite at the same time, so we define k := min {«,/}. It is 
easy to see that as ti ^ t2, k 00. We also have that B^j} = A^^ , so we 
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can bound the distance of the hmit points Mt^{A,B) and Mt^{A,B) from 
Bl' = A*jf as foUows: 



<Ell^m-^MI 

i=k 

E - ^Ml ^ [it:(Ao,So) - 1] \\K\\ = (17) 

i=fe 

= ^[it:(Ao,So)-i] \\K\\. 

We also have the same bound for 11^4^^ — hmj^oo^-^H- Since B^^ = A*^, we 
have 

\\Mt, {Ao, Bo) - Mt, (Ao, Bo) \\ <\\Mt, {Ao, Bo) - 5*^ || + 

+ \\M,,{Ao,Bo)-Ai^\\ < (18) 

<-L_[R(^Ao,Bo)-l] \\K\\. 

Since A; oo as ii ^2, by the above ||Mt^(^, B) - Mt^{A, B)\\ 0. □ 

By the above proposition we have that Mt{A, B) is a continuous function 
in t. So Mt{A, B) is a one parameter family of matrix means corresponding 
to every symmetric matrix mean. Since every matrix mean by virtue of 
property (ix) is representable by a normalized operator monotone function 
f{x), we may represent such one parameter family of matrix means by a one 
parameter family of normalized operator monotone functions ft{x),t G [0, 1]. 
So in other words we have the following 

Corollary 5. For every symmetric matrix mean M[A, B) there is a corre- 
sponding one parameter family of weighted means Mt{A,B) for t G [0, 1]. 
Let f{x) be the normalized operator monotone function corresponding to 
M(A, B) . Then similarly we have a one parameter family of normalized 
operator monotone functions ft{x) corresponding to Mt{A,B). The family 
ft{x) is continuous in t, and fo{x) = 1 and fi{x) = x are the two extremal 
points, so ft{x) interpolates between these two points. 

Based on this phenomenon we can conclude the following 



B*i - hm 5* 



10 



Proposition 6. Let M{A, B) be a symmetric matrix mean. Then 

((1 - t)A-^ + tB-^) < Mt{A, B)<{1- t)A + tB, (19) 

where Mt{A,B) is the weighted version of M{A, B) . 

Proof. For every symmetric matrix mean M{A, B) we have by ([2]) tliat 

A-^ + B-'^\''^ ,^,A + B 

<M{A,B)<^—. (20) 



Now wliat follows from Propostion |2] is that the harmonic mean on the left 
hand side above has the weighted harmonic mean Ht{A,B) defined in ([3]) 
as its weighted counterpart, and similarly we have the weighted arithmetic 
mean At{A,B) defined in (jl]) as the weighted counterpart for the arithmetic 
mean on the right hand side above. Thus by property (v') in Proposition [3] 
and the above inequality we have 

Ht{A,B)<Mt{A,B)<A{A,B). (21) 

□ 

We are going to use the above definition Mt{A, B) of a weighted matrix 
mean to set up the Bini-Meini-Poloni procedure for every symmetric matrix 
mean, but before we do that we turn to the Ando-Li-Mathias procedure in 
the following section and prove its convergence for every symmetric matrix 
mean. 



4. Ando-Li-Mathias procedure for every matrix mean 

In this section we will prove the convergence of the Ando-Li-Mathias 
procedure for every possible symmetric matrix mean. In order to do that we 
will generalize the argument given in |j9|] by applying induction. First of all 
let us recall the Ando-Li-Mathias procedure [l|: 

Definition 5. [ALM iteration] Let X = (XO,...,X°) where X° G P(r) 
and define the mapping M{Xi, . . . , Xn) inductively as follows. If n = 2 
assume that M(Xi,X2) is already given. For general n > 2 assume that 
M{Xi, . . . , Xn-i) is already defined. Then using M(Xi, . . . , X„_i), set up 
the iteration 

Xi+i = M(Z^,(X{,...,Xy), (22) 
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where Z^i{X[, . . . , X'J = X[, . . . , Xl^, X\^^, • • • , If the sequences X\ 
converge to a common hmit point for every i, then define 

hmXi = M(X?,...,X°). (23) 

Theorem 7. Let F : P{rY ^ P{r) and suppose that F{A, B) fulfills one of 
the inequalities below: 



< F{A,B) < 



A^ + B^ k 



1/2 



(24) 



for a k E (0, 2], or 



F{A,B)<^±^. (25) 

Then m Definition \E starting with M{A,B) := F{A,B), M(Xi,...,X„) 
exists for all n, in other words the sequences converge to a common limit 
point for all n. 

Before we prove the above theorem we mention a few remarks and several 
lemmas which we will make use of later. First of all condition f l24p might 
seem a bit strange at first glance although it immediately becomes straight- 
forward if we consider k = 2, since in this case the right hand side becomes 
the arithmetic mean. If = then the right hand side is the square mean 

1/2 

2 , . This literally means that the above theorem automatically cov- 
ers every symmetric matrix mean due to ([2]) as a special case. 

Now we have to study some properties of the square mean in order to 
prepare the necessary steps for the proof of the above theorem. First of all 
one should notice that the square mean is an affine mean. The underly- 
ing manifold is a Riemannian manifold defined as a puUback metric of the 
Euclidean metric {A,B)^ = Tr{A*B} over the space of squared complex 
matrices. This Euclidean metric has corresponding distance function 



dE{A,BY = {A-B,A-B)^ = 
= Tr {{A - B)* {A - B)} . 



(26) 



The isometry is f{x) = and it embeds P(r) into P(r). The distance 
function of the pullback metric on P(r) is 



dy.iA, Bf = if (A) - fiB), fiA) - fiB))^ = 
= Tr{{A'-BY{A'-B^)}. 



(27) 
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The geodesies of this metric are of the form 

lA,B{t) = r' [(1 - t)fiA) + tf{B)] = [(1 - t)A' + tB'] . (28) 

This shows that square mean is an affine mean, so the weighted mean pro- 
cess Mf(yl, 5) for the square mean gives back the corresponding point on 
the geodesic above. Furthermore since the above metric is a pullback of a 
Euchdean metric, it is also Euchdean. 

Actually the isometry j[x) can be chosen arbitrarily, particularly any 
diffeomorphism will suffice. We are going to derive some properties of the 

ALM-procedure with F{A,B) = (^^^^^ and F{A,B) = ( ^"'+^'' )~^ 

on P(r) endowed with the above corresponding pullback metrics. We are 
going to denote the general pullback of the distance function dE{-,-) for an 
arbitrary / by 

df{A, BY = ifiA) - f{B), f{A) - f{B))^ = 

= Tr{[f{A)-f{B)r[f{A)-f{B)]}. ^ ^ 

The metric space P{r) with the distance function fl29l) is Euclidean, since 
its metric is a pullback metric of the standard Euclidean metric on the space 
of complex r x r matrices. Let Xi e P(r) for i G {l,...,n} and define 
S = {xi, . . . ,Xn}- Then according to j3] the function 

n 

b{x) = '^d{x,Xiy (30) 

1=1 

has a minimum for d{-,-) = df{-,-) and this minimal value is attained at 
a unique point x which is called the Riemann centroid of S. Moreover the 
centroid can be explicitly given for these metric spaces on P(r). 

Proposition 8. The unique minimizer x of the function 

n 

b{x) = ^d{x,Xiy (31) 

1=1 

for the distance function (129!) is given as 

£ = /-I ^SkZM^ . (32) 
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Proof. Since the corresponding metric is a pullback of the Euchdean metric 
over the space of squared complex matrices it is also Euclidean. Using the 
isometric embedding f{x), the object function of the minimization problem 
is of the form 

n n 

^ df{x, = ^dE (fix), f{xi)f . (33) 

i=l i=l 

But since the Riemann centroid of the set 5* = {/(a;i), . . . , /(x„)} in the 
Euclidean space of squared complex matrices is the arithmetic mean of the 
points {/(xi), . . . , f{xn)}, therefore 



n 



(34) 



minimizes the functional X^ILi '^e {x, f {xi)Y , so x = f ^(a) minimizes 

j:UdEifix),fix.))\n 

If we perform one ALM-iteration step on n points in the space P(r) with 
this centroid map then the iteration leaves the Riemann centroid of the points 
invariant. 

Proposition 9. Let X? G P(r) for i = 1, . . . ,n. Then the ALM-procedure 
(Definition set up on the matrices with the n — 1 variable 

^^n-i ) ^^^"^^s ^he Riemann centroid of 
the points X°, . . . , X° invariant with respect to the distance function fl29|) . 

Proof. 



r 



n 



y. /(M(Z^.(X°,...,X°))) 



i=l 



r 



n spn-1 /(^°) 
£^i=ljVi 71-1 



i=l 



n 



r 



n 



(35) 



Similarly we obtain the above equality for every iteration step, so 



n 



f 



n 



r 



n 



(36) 



□ 



14 



We turn our attention to more general functions than these puUback 
means. The following inequalities will turn out to be useful tools later. 



Lemma 10. 



.1/2 



(1 - t)A'' + tB^ - -^t{l -t){A- B) 



< 



[1 - t)A' + tB' - ^t{l -t){A- Bf 



< 

1/2 



(37) 



for any A,B e P{r) and t E [0, 1] if ki < k2. 
Proof. 

Ok'^HI -t)iA-Bf 

-^t{l-t){A-Bf<-^-^t{l-t){A-Bf 
h 



1 - t)A' + tB^ - ^t{l -t){A- Bf < 



<(1 - t)A' + tB' - -t)(A- BY 

Taking the square root of both sides and considering the fact that the the 
square root is operator monotone we get the inequality of the assertion. □ 

Notice that for k\ — ^ and A;2 = 2 we get the weighted arithmetic-square 
mean inequality 

+ < + (38) 

We will prove an important inequality which will play a fundamental 
role in our further investigations. An important part of the proof of the 
convergence of the ALM- and BMP-process will rely on this inequahty. 



Lemma 11. Let A; e [0, 2], F : P{rf ^ P{r) and 



F{A,B)< 



(1 - t)A^ + tB"" - h{l -t){A- Bf 



1/2 



(39) 



Then with the distance function dE{A, Bf = Tr {(A - B)*{A - B)), 



dE (0, F{A, B)f < (1 - t)rfE(0, Af + M£;(0, Bf - -t(l - t)dE{A, Bf. (40) 
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Proof. By substitution the assertion has the following form 

Tr{F{A,Bf} <Tr !^{l-t)A' +tB'- -'^t{l-t){A- (41) 

This holds, since we have the following identity for hermitian positive definite 
X <Y 

<Tr {Y^ - X^} =Tr{{Y ~ X){Y + X)} . (42) 

By choosing X = F{A, B) - [{1 - t)A'^ + tB^ - |t(l -t){A- B)^]^^^ and 

Y = F{A,B) + [{l-t)A^ + tB^ - !lt{l -t){A- BfY^^ we get the asser- 
tion. □ 

Notice that the above lemma is already true for every matrix mean 
M{A,B) and their weighted Mt{A,B) counterparts by Lemma [6] and fl38|) . 
By Lemma [To] we also have a relatively wide family of functions which fulfills 
the conditions of the above lemma. 

Now we are in position to prove Theorem [71 

Proof (Theorem [7]). The proof will be based on induction on the number 
of matrices n. We are going to measure the sum of the squared distances of 
the matrices from the zero matrix with respect to the distance function 
([26D with 

n n 

< = Y.dEiO,Xlf = J2Tr{{Xlf} . (43) 

i=l 1=1 

We will also measure sum of the squared distances of the X^ from one another. 
We will form this sum over all possible pairs of X- as 

ei= E MXl,X^^r= Tr{{Xl-Xir}. (44) 

We are going to denote the common limit point of the sequences X' 
by F„(X°, . . . , X°) for n. We will need the following lemmas which will be 
proved by induction as well on the number of matrices n, so we have to embed 
these lemmas into this proof of Theorem [71 All three lemmas will be proved 
by assuming that they hold for n matrices and also that the ALM-procedure 
converges to common limit for n matrices. Making this assumption we show 
that the lemmas hold for n + 1 and that the ALM procedure converges to 
common limit for n + 1 as well. For the first step of the induction {n = 3) 
we will prove the lemmas directly. First we are going to treat the case of the 
first inequality (^^. 
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Lemma 12 (Monotone Iteration). In the first case of inequality fl24p we 
have 

n J \ ^ J 

In the second case of inequality fl25p we /iaf e 

i < ^»=1^ . (46) 

n n 

Proof. We argue by induction on the number of matrices n by making the 
assumption that the ALM-procedure converges for n > 3 to common limit 
point F„(X?, . . . , X°), in other words Xl . . . , X°) for n and that 

the lemma holds for n. Consider the first case of inequality fl2^ . Then the 
inequality of the lemma for n implies that 

Ejkwry' > /Ekwrv' (47) 

n J \ n ' 

and if we take the limit on the left hand side for n we get the inequality 

n J n I, , n — \ ^ 

' (48) 

The above inequality also holds directly for n = 2 by the assumption of 
inequality (^^, so this will also provide the first step in our induction. 
Now we prove the lemma for n + 1 if it is true for n. By (148|) we have 

X^^ = F„ {Z^Xl . . .,Xl^,)) > l^^l^M^lL^\ . (49) 

The n + 1-variable harmonic mean is operator monotone in its variables, 
therefore if we take the n + 1-variable harmonic mean of the above on the 
left and right hand side, we get 

'E^i {Z MX[,...,XUr)r \ " ^ (U^liHiy^ " ^ .50) 
n + 1 I \ n + 1 
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where = ( ^'^^'^'^'^ ) . Then by Proposition [9] with /(t) = t'^, the 



harmonic mean of the n + 1 matrices is left invariant on the right hand side, 
so this is equivalent to 



n + 1 / \ n + 1 



> ^'=1 " • (51) 



The second case given by inequality ( l25l) is very similar to the proof of 
the first case. Instead of inequality ( l47j) we have 

'=1^ < ^^=^^ (52) 
n n 

and instead of (148!) we have 

^^=^ = F„(X°, . . . , X°) < ^-=^ (53) 

n n 

The rest of the argument is just the same, although we have the n-variable 
arithmetic mean replacing the n-variable harmonic mean, and the inequalities 
are reversed. The lemma is proved. 

Lemma 13 (Decreasing Distances). We have 

<al- ^Znci (54) 

in the case of f l2^ . or we have (15^ with k = 2 in the case of (l25l) . In both 

cases Zn = 

" n— 1 

Proof. We will prove this for the case (^^. The second case of fl25|) is just 
the same with k = 2, we will only use that the right hand side of ( l24|) holds, 
so we do not have to treat the second case ( l25i) separately due to ( l38i) with 
t = 1/2. The first step is to show the above for n = 3. By Lemma [TT] and 
that the right hand side of (12^ is equivalent to the assumption of the lemma 
for t = 1/2, we have 



o dEio,xiy + d{o,x'f k 



dE {0,FiXlxl)Y < ^^^"""^ ' ^^""^^^ - ^dE{Xl,Xir 

(55) 
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where s G {1, 2, 3} and i ^ j ^ s,s ^ i. There are 3 distinct inequahties 
of the above for s = 1,2,3. By summing these inequahties for s we get 
for n = 3 and = 1. 

Now suppose fl5^ holds for n and that converge to a common hmit 
point for n denoted again by . . . , Then we have 

< a° - ^^„e° (56) 
and by taking the hmit on the left hand side we get 



hm al = ndE {O, F^{Xl . . . , X'jy < 



2 , n k 



Z— >-oo 



rf.(0,F„(X?,...,X°))^<^-^^e°. 



(57) 



Then set up the ALM-procedure on X? G P{r);i = l,2...,n+ 1 with 
M„(Xi, . . . , Xn) := Fn{Xi, . . . , X„). Inequality (!57|) can be applied in any of 
the iteration steps, so we get 

dE (0,F„(Z^, (X{,...,X^^i)))'< 

^ E"=i,jyi c;E(0,Xj)^ ^kzn ^1.2 (58) 

If we sum these inequalities for % we arrive at the following 

n+1 



Y^dE (0,F„(Z^, (X{,...,X^^O))'< 



j=i 

n+1 Y^n+1 7 /p, T^i\2 , n+1 

^y L,^i,V.^i.(0,^.) _A:^y y rf^(X;,Xi)2. 

2=1 i=l l<i<s<n+l 



(59) 



The left hand side of the above is just aj^\. The first term on the right hand 
side is easily written as 

n+1 v^n+l 7 ir. ylVl n+1 

E ^ =E''^(°.^.')^ = °U- (60) 

j=l i=l 

We have to carefully analyze the second term 

n+1 

Y dE{X\,Xif. (61) 

i=l l<j<s<n+l,j^i,s^i 
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Consider the complete graph Kn+i on n + 1 vertices labelled from 1 to n + 1. 
In this way we have a natural bijective mapping between the matrices 
and the vertices of Kn+i- Then for every squared distance dE{X^,,Xl) 
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we have a corresponding edge in Kn+i of the form (j, s). Then the sum 
J2i<j<s<n+i j^i sy^i^^iXj, Xl)'^ is just the sum of the squared distances cor- 
responding to the edges of the complete graph Kn given on the vertices 
{1, . . . ,i — l,i + 1, . . . ,n + 1}. This is almost X]i<j<s<ra+i 

dE{X',,Xi)\ but 

we leave out from the sum every squared distance corresponding to an edge 
that has the vertex i as an ending vertex. So actually (I6T]) almost equals to 

n+1 

E MXl,Xir = {n + l)eU„ (62) 

*=1 l<j<s<n+l 

but in the sum fl^ every vertex has been left out once, so every squared 
distance corresponding to an edge has been left out twice, hence 

n+1 

J2 E dE{X^,,Xir = {n-l)eU,. (63) 

This shows us that Zn+i = ^^^n and also = 1, so in other words by 
solving the recursion we get 

Zn = (64) 

n — 1 

This concludes the lemma for every n. 

Lemma 14 (Boundedness). The matrix sequences X^ are bounded for all 
n. 

Proof. We have the trivial lower bound X^ > 0, since by assumption 
F{A, B) > 0, so we have X- > for n = 3. Now assume again that the ALM- 
procedure converges to common limit point denoted by F„(X°, . . . ,X°) for 
n and F„(X°, . . . ,X°) > 0. Then trivially for n + 1 the sequences X- > 
since Fn{Xf, . . . , X^) > 0. This also shows that if the sequences converge for 
n + 1 to a common limit Fn+i{Xf, . . . , X°), then this limit is also bounded 
from below, so F„+i(XO, . . . , X°) > 0. 

Now we provide a suitable upper bound as well. By the previous asser- 
tion Lemma Decreasing Distances we have for n > 3. In particularly 
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for n = 3 it holds providing the first step, while for n > 3 we need the in- 
ductional hypothesis that the ALM-procedure converges to a common limit 
point . . . ,X°_i) for n — 1. The rest of the argument is just the 

same for all n > 3. So by (l54l) we have < which means that the 
sequence is monotone decreasing. So we have the bound 

«n < «n = b. (65) 

By this above we get for arbitrary i that 

n 

d{0,Xl)' = al- J2 d{0,XlY <al<b. (66) 

This means that the norm is bounded from above by b since 

m" = Tr{{Xl)'}=d{0,Xl)\ (67) 
This concludes the proof of the lemma. 

Now we move on to the final step of the induction. We prove that for 
n = 3 the ALM-procedure converges and that if it converges for n then it 
converges for n + 1 in both cases of inequalities fl2^ and fl25|) . This last 
step will be based on the three lemmas: Lemma Monotone Iteration, Lemma 
Decreasing Distances and Lemma Boundedness. It is not necessary to prove 
separately the n = 3 case since these three lemmas hold for n = 3 and the 
argument will be the same as for general n + 1 requiring the inductional 
hypothesis, the convergence of the procedure to a common limit point for n. 

So by Lemma Decreasing Distances we have 

ai+^ < - '^znel (68) 

in other words is a decreasing nonnegative sequence in /, therefore con- 
vergent. Since Zn > and has fixed value for each n by Lemma Decreas- 
ing Distances, this means that — )■ as Z — )■ oo, so the matrices Xl are 
approaching one another. By Lemma Boundedness we have that these se- 
quences are bounded, hence they have convergent subsequences. But since 
— )■ these subsequences are converging to a common limit point. Let 
X-'- denote a subsequence converging to say A and another subsequence 
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converging to B. Without loss of generality we can take si > ri. By Lemma 
Monotone Iteration for the case of inequality we have 



Now choose a subsequence of subsequences Sj < rj so then again by the 
lemma 

Taking the limits we have A > B and A < B so A = B. In the second case 
( 125|1 we have the same argument but using the n-variable arithmetic mean 
instead of the n-variable harmonic above. 

Now this argument shows the convergence of the ALM-procedure to a 
common limit point directly for n = 3 and inductively for n assuming con- 
vergence to common limit for n — 1. □ 

We are going to study some properties of this limit point later, jointly 
with the case of the BMP-mean, after showing that the BMP procedure 
converges. In the next section we will show a similar theorem to Theorem [7] 
for the BMP-procedure. 



5. Bini-Meini-Poloni procedure for every matrix mean 

In this section we will treat the case of the Bini-Meini-Poloni procedure. 
We may do that for matrix means since we have defined a weighted mean 
Mt{A,B) corresponding to any symmetric matrix mean M{A,B). The out- 
line of the proof of the convergence of the BMP-procedure will roughly follow 
the one of the ALM-procedure, although some lemmas will be formulated dif- 
ferently. 

Firstly let us recall the Bini-Meini-Poloni procedure 

Definition 6. [BMP iteration] Let X = {X^,...,X^) where X^ e P{r) 
and define the mapping M(Xi,...,X„) inductively as follows. If n = 2 
assume that Mt{Xi,X2) is already given. For general n > 2 assume that 
M(Xi, . . . , Xn-i) is already defined. Then using M(Xi, . . . , X„_i), set up 
the iteration 

= M._^ [Xl M (Z^,, . . . , X^))) , (71) 
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where Z^i{X[, . . . , X'J = X[, . . . , Xl^, X\^^, • • • , If the sequences X\ 
converge to a common hmit point for every i, then define 



limXi = M(X?,...,X°) 



(72) 



Theorem 15. Let F : [0,1] x P(r)^ i-)- P(r) and suppose that Ft{A,B) 
fulfills one of the inequalities below: 



[{l-t)A-' +tB-^] ' < Ft{A,B) < 



< 



[1 - t)A^ + tB^ - ^t(l -t){A- BY 



.1/2 



(73) 



for a A; G (0, 2] and every t G [0, 1], or 

FtiA,B) < {l-t)A + tB, 



(74) 



for every t G [0, 1] . Then in Definition\^ starting with Mt{A^ B) := Ft{A, B), 
M(Xi, . . . ,X„) exists for all n, in other words the sequences converge to a 
common limit point for all n. 



Before we turn to the proof of the above theorem, we again consider 
some lemmas which will be similar to the ALM case. Let us recall again the 
metric space P(r) with the distance function (12^ . We already know that 
the minimum of 

n 

h{x)=Y,df{x,x;)^ (75) 
is attained at a unique point in P(r) denoted by x and we also know that 

X = f~' (^Si/M^ . (76) 

We will need a similar theorem to Proposition [9l 

Proposition 16. Let G P(r) for i = 1, . . . ,n. Then the BMP-procedure 
(Definition set up on the matrices with the weighted mean 

function Mt{A,B) := /^^ ((1 -t)f{A) +tf{B)) and then-1 variable func- 

tion M(xi, . . . , Xn~i) '■= f~^ ( ) leaves the Riemann centroid of the 

points X^, . . . , invariant with respect to the distance function f l29|) . 
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Proof. 



f 



n 



" / [M^ {XI M {Z+lXl X"))) 



i=l 



f- 



^ ^ n n ^j=l,jj^i 71-1 



i=l 



n 



r 



if Ell fix?) 



n 



Similarly we obtain the above equality for every iteration step, so 



r 



1 f Ell f {xn \ _ ,-1 (Et, f [xl) \ _ , , fEl^ f (xf 



n 



f 



n 



r 



n 



□ 



(77) 



(78) 



Proof (Theorem 1151 ). The proof again will be based on induction on the 
number of matrices n. We will use the same notations to denote the sum of 
the squared distances of the matrices from the zero matrix with respect 
to the distance function (l26l). so 



(79) 
(80) 



ai = j2Mo,xir = j2Tr{{xir} 

1=1 1=1 
eL= E dE{XlXir= Tr{iXl-Xir}. 

l<i<j<n l<i<j<n 

We will denote by F{Xf, . . . , X°) the common hmit point of the sequences 
Xl for n. The proof will rely on similar three lemmas to the ones in the 
proof of the ALM-procedure. First we are going to treat the case of the first 
inequality ( 1731) . 



Lemma 17 (Monotone Iteration). In the first case of inequality ( 1731) we 
have 



Et, {xn 



> 



Etr {xir 



n 



n 



In the second case of inequality fl74j) we have 



En x^i+1 Y^n 



n 



n 



(81) 



(82) 
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Proof. The proof uses similar ideas to the case of Lemma Monotone It- 
eration for the ALM-process. We again argue by induction on the number 
of matrices n. Consider the first case of inequahty fl73|) . Suppose that the 
BMP-procedure converges for n > 3 to common hmit point F[X^, . . . 
in other words X- — )■ F{X^, . . . , X^) for n. Also the inequality of the lemma 
for n implies that 



si.MllV' > j'skiio:: I (83) 

and if we take the limit on the left hand side for n we get the inequality 

Yh=i (-F( x{', ■ ■ ■ ,x°)) ^ \ ^ ^ 1^ ^ / YTi=i 

n I 1' ' n — I ^ 

(84) 

The above inequality also holds for = 2 by the assumption of inequality 
fl73|) . so this provides the first step for = 3 in our induction. 

Now we prove the lemma for n + 1 if it is true for n. Similarly to the 
case of Lemma Monotone Iteration in the ALM process, we make use of the 
operator monotonicity of the n + 1-variable harmonic mean, and use ( 18^ . 
Then we use Proposition [T6] with f{t) = and the same argument as in 
the ALM case, performed using instead one BMP iteration step, yields 

n + l J ~ \ ^ + 1 / ' 

The second case given by inequality (!7il) again can be treated similarly 
to the ALM-case. 

Lemma 18 (Decreasing Distances). We have 

in the case of ( 175]) . or we have (jHS]) with k = 2 in the case of (ITIIl . In both 

_ 4 
{n—\)n 



25 



Proof. The situation is similar again to the ALM case. We will prove this 
for the case (175]1 . the second case of (1711) is just the same with k = 2, since 
we will only use that the right hand side of fl24|) holds, so we do not have to 
treat the second case fITil) separately due to (l38l) . The first step is to show 
the above for n = 3. By Lemma [11] and that the right hand side of (I73|l is 
equivalent to the assumption of the lemma, we get 



dE (0,F2/3 {Xi,F{Xl,Xl))Y <^dE{0,Xiy + ^d{0,F{Xix'^)) 



1 , ,.Ko 2 



2 



A;12 
~ 2 33 

,2 , 1 , 2 



---ci,,(X^,F(X,Ui))' 



rf^ (0,F2/3 (X^,F(Xi,Xj))) <-f/i,(0,X^)2 ^ -rf(0,F(Xi,Xj))2 



dE (0,F2/3 (Xl,F(Xi,X^)))'< 



3 -V , .3 

dE{0, Xlf + dE{0, xlY + d{0, X|)2 



- g 3'^s(X-,X])^ 



(87) 



in other words the last inequality is equivalent to 



where s G {1, 2, 3} and i j s, s i. There are 3 distinct inequalities 
of the above for s = 1, 2, 3. By summing these inequalities for s we get 
for n = 3 and 23 = |. 

Now suppose fl86l) holds for n and that X^ converge to a common limit 
point for n. Then we have 

< < < - ^^ne° (89) 
and by taking the limit on the left hand side we get 

hm al = ndE (O, F(X?, . . . , X"))' < 



2 . n k 



flO _ H 

"n o " " 



ci^(0,F(X?,...,X°))^<^-^^e°. 



(90) 



n on 
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Then set up the BMP-procedure on G P{r);i = l,2...,n+l with 
Mn{Xi, . . . , Xn) := F{Xi, . . . , Xn). Inequahty (Mill can be apphed in any of 
the iteration steps, so we get 



n 8 n , , 



(91) 



Then we have to compute X^^^ = F_ji_ [XlF [Z^i . . . and 
bound its squared distance from the zero matrix 

dE (O, {Xl F {Z+, (Xl, . . . , X^+O)))' < -l^rf^ (0, + 



1 

8 + 1 n + 



1 ^ (^^^ • • • ' ^-^))))' • 



We drop the last term, as it seems that it is hard to estimate it from below, 
and substitute in using inequality (19T1) . we get 



dE (0,X^^)^ < -l^rf^ (0,XO^ + -^^dE {Q.F{Z^, (Xl, . . .,XU,))y 



< 



+ 
^^2 



+ 



< 



n + 



n 



-rfi,(0,Xi) + 



n + 1 



n 



n 



rf£;(Xj,X^)^ 



n + 1 



n + 1 



(93) 



If we sum these inequalities for i we arrive at the following 

n+l 



5^rf^(o,xr)^< 



ra+1 



< 



E;:i^i^(o,xj)2 k 



(94) 



n + l 



(i£;(Xj,X^)^ 
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which is equivalent to 



n+1 



(95) 

i=l i=l l<j<s<n+l,j^i,s^i 

The left hand side of the above is just a[j^\ and the first term on the right 
hand side is a^n+i- the proof of the convergence of the ALM- process the 
second term 

E dE{X\,X[f = {n-l)ei^,. (96) 

i=l l<j<s<n+l,jj^i,Sy^i 

Thus Zn+i — ^^^n and also 2^3 = |, by solving the recursion we get 

4 

(97) 



(n — l)n 

This concludes the lemma for every n. 

Lemma 19 (Boundedness). The matrix sequences X\ are hounded for all 
n. 

Proof. We have the trivial lower bound X\ > 0, since by assumption 
Ft{A,B) > 0, so we have X^ > for n = 3. Now similarly to the case 
of the ALM-process we assume again that the BMP-procedure converges to 
common hmit point denoted by F(X°, ...,X°) for n and F{X^, ...,X^) > 
0. Then again if the sequences converge for n + 1 to a common limit 
F{Xl . . . , XO+i), then this Umit F(X?, . . . , X^^,) > 0. 

We provide the suitable upper bound similarly to the ALM case. We 
again have al^^l < a^_^_i = 6 so we similarly get \\Xl\\ < 6 for n + 1 if the 
procedure converges to common limit for n. This finishes the proof of the 
lemma. 

Now the last step of the proof is exactly the same as in the case of the 
ALM-procedure, we just have to use the three lemmas: Lemma Monotone 
Iteration, Decreasing Distances and Boundedness adapted for the case of the 
BMP iteration. □ 
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Notice that for matrix means we have the weighted mean procedure 
Mt{A,B) introduced here. By Proposition 12] we have that every such mean 
is smaller than the weighted arithmetic mean and larger than the weighted 
harmonic mean. So as a consequence of Theorem [15] we get that the BMP- 
procedure converges for every symmetric matrix mean if we identify their 
weighted counterparts with our weighted mean Mt{A,B). 

6. Properties of the ALM and BMP mean 

We will show that the limit point of the ALM and BMP processes, denoted 
by Malm (yXi, ■ ■ ■ , A„) and Mbmp{Xi, . . . , X„) respectively, as extensions of 
symmetric matrix means, fulfill the following properties. 

Theorem 20. // M{A,B) is a symmetric matrix mean, then the M : = 
Malm{Xi, ■ ■ ■ , A„) and M := Mbaip{Xi, . . . , X„) extensions fulfill the fol- 
lowing properties 

(I) M(A, ...,X) = X for every X G P{r), 

(II) M{Xi, . . . , Xn) is invariant under the permutation of its variables, 

(III) min(Ai, . . . , X„) < M{Xi, . . . , A„) < max(Ai, . . . , A„,) if min and 
max exist with respect to the positive definite order, 

(IV) IfX, < X[, then M(Ai, . . . , A„) < M{X[, . . .,X'^), 
(V) M (Xi, . . . , Xn) is continuous, 

(VI) M{CXiC\ . . . , CXnC*) = CM{Xi, . . . , Xn)C* for all mvertihle C. 

Proof. The proof of each property will be based on induction. Each of 
them trivially holds for n = 2 by properties of matrix means discussed in the 
second section. So it remains to prove them for n + 1 assuming that they 
hold for n. 

Property (I) and (II) trivially holds for n + 1 if it holds for n. We prove 
property (IV). Let X^,...,X^^^ e P(r) and Af < {X^f e P(r). If we 
iterate by the ALM process, it is easy to see that the order Xf < (Aj')° is 
preserved due to the inductional hypothesis on property (IV), so X^ < {X^y. 
Taking the limits / — ?■ oo we get the assertion. In case of the BMP-process 
the argument is similar but we have to use also that Mt{A, B) < Mt{A' , B') 
iiA<A' and B < B'. 

Property (III) is an easy consequence of property (I) and (IV), if minimum 
and maximum exist. Setting up the same iteration on the new n-tuple formed 
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by the minimal element we get the inequality on the left in property (III), 
similarly we can obtain the inequality on the right as well. 

To prove property (VI) let (X;)° = CX^C* and set up the AIM or BMP 
process on {X[Y , . . . , (X^)" as on Xj", . . . , X°. Property (VI) implies in the 
case of ALM 



CXl^^C* = CM {Z^, . . . , X^+i)) C* = 
= M{Z^, {CX[C*,...,CXl^,C*)). 



(98) 



(99) 



In the case of the BMP process we have similarly 

CXl^'C* = CM^^ {Xl M {Z^, . . . , Xi^,))) C* = 
= {CXlC\ M {Z^, {CXiC\ . . . , CXl^.C*))) . 

Applying the above recursively in every iteration step we get 

CXlC* = {Xiy. (100) 

Taking the limit / — )■ oo the assertion follows. 

Property (V) is a consequence of properties (IV) and (VI) by Lemma |H 

□ 

We also have that the ALM and BMP procedures preserve the ordering of 
functions. So we have for the ALM process the following 

Proposition 21. If M{A,B) < N{A,B) are functions satisfying the prop- 
erties of F{A, B) in Theorem^ then the same ordering is true for the ALM 
limit points M{Xi, . . . , X„) and N{Xi, . . . , X„). 

Proof. Again we argue by induction. The inequality M{Xi, . . . , Xn) < 
N[Xi,...,Xn) holds for n = 2 by assumption. Let us denote the ma- 
trices in the ALM iteration steps performed with M(Xi, . . . , X„_i) and 
iV(Xi, . . . , X„_i) on X?, . . . , XO e P(r) by X^ and (X^)' respectively. Now 
again we have M(Xi, . . . , X„_i) < X(Xi, . . . , X^-i) by the inductional hy- 
pothesis so we have X,- < (X^')'. Taking the limits we get the assertion. 
□ 

A similar, although a bit different assertion holds for the BMP process. 
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Proposition 22. If Mt{A, B) < Nt{A,B) are functions satisfying the prop- 
erties of Ft{A, B) in Theorem lT5\ then the same ordering is true for the BMP 
limit points M{Xi, . . . , X„) and N{Xi, . . . , 

Proof. We have an inductional argument similarly to the preceding case of 
the ALM process. The inequality M[A, B) < N{A, B) holds for by assump- 
tion since M{A, B) = Mi/2{A, B) and N{A, B) = Ni/2{A, B). Let us denote 
the matrices in the BMP iteration steps performed with M(Xi, . . . 
and N{Xi, . . . , Xn-i) on . . . , X° e P(r) by X\ and (xf)' respectively. 
Now again we have M(Xi, . . . , X„_i) < A^(Xi, . . . , X„_i) by the inductional 
hypothesis and also Mt{A,B) < Nt{A,B) so we have X^ < (X,')'. Taking 
the limits we get the assertion. □ 

In the next section we will consider some convergence rate properties 
fulfilled by the BMP process. 

7. Convergence rate of the BMP process 

A well known property of the BMP process is its cubic convergence rate in 
a small neighborhood of its limit point. This is an advantage over the ALM 
process which is known to converge linearly. Or to be more specific these 
convergence rates are only precisely known in the case of the geometric mean 
[H, [2I . In this section we will show that the BMP process generally converges 



point of the process. The proof will be similar to the one presented in [2| . In 
order to be able to use such an argument we have to obtain a series expansion 
for the weighted mean Mt{A, B) in the neighborhood of the identity matrix 
/. 

We are going to use the big O notation as it is used in jij. This means 
that we have X = Y + 0{e^) if and only if there exist constants eo < 1 and 
9 such that for each < e < eo we have ||X — Y\\ < 9e^ . 

Proposition 23. Let M{A,B) be a symmetric matrix mean and f{t) be its 
corresponding normalized operator monotone function. Let f{t) have a series 
expansion around I as 





(101) 



k=2 
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Then we have a series expansion for Mt{I , X) = ft{X) whenever ||X — J|| < 
e <1 in the form 

ft{X) =I + t{X-I)+ 462t(l -t){X- If + 0{e^). (102) 

Proof. Since Mt{A,B) is a matrix mean if the generating M{A,B) is a 
symmetric matrix mean therefore it has the following representation 

MtiA, B) = A^'^ft {A-^'^BA-^I^) A^'^ = Aft {A^^B) , (103) 

where ft{X) is a normalized operator monotone function in X, therefore 
analytic on (0,oo), hence we have the second equality as well. Since it is 
generally analytic only on (0, oo), we expect ( llOip to be convergent only for 
||X — /|| < 1. By the above representation for Mt{A,B) and the fact that 
Mi/2{A, B) = M{A, B) by definition, it is enough to show that the expansion 
in the assertion holds for ft{X). In other words we have to consider the mean 
Mt{Ao,Bo) of Aq = I and an arbitrary Bq = X. We also have a natural 
expansion in the neighborhood of / for the inverse function as 

oo 

x-^ = ^(-l)^(x-/)^ (104) 

which is convergent for ||X — /|| < 1. Now in every step of the Weighted 
mean process we have to compute a symmetric mean of two matrices and by 
the assumption of the assertion we have — J|| < e < 1. Without loss of 
generality we may write Aj and Bj in the following forms 

A, = I + yiiX - J) + yiiX - If + 0{e^) 

105) 

B^ = I + z{{X-I)+zi{X-lf + 0{e^). 

Now we will make use of the above expansions to get an expansion for 
M{Aj, Bj) up to the O(e^) term as follows 

M{A„ B,) = A,f{Aj'B,) ={I + yiiX - /) + y^(X - if + 0{e')) 
f [AJ' (/ + ziiX -I) + zi{X- If + 0{e^))] = 
= {I + yiiX -I)+yi(X- If + 0(e^)) / [(/ - viiX - I) + 

Hiyir - yi)ix - if + o(e^)) (/ + z^x - /) + 4(x - if + oie'))] 

(106) 
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where we have used fllU4p to express Aj^ and fllOip to express f{X) up to 
O(e^) terms. After some calculation and taking into account that the terms 
{X — lY with > 3 are of O(e^), we get that 

{X-lf + 0{e^). 

(107) 

Note that since Aq = I and Bq = X we have 1/2 = ^'^^ -^2 = 0. Hence it 
can be easily proved by induction that the terms 

where pj and qj are functions which do not depend on 62- Also since ft{X) 
is an analytic function due to Kubo-Ando theory, therefore the limits p = 
limj^oo Pj = liiiij-j.oo (Jj and this limit function is also independent of 62- 
Now since the weighted geometric mean 

GtiA, B) = A^'^ {A-'I^BA~^I^)' A^'^ = A {A-^B)' (109) 

is an affine mean, therefore the Weighted mean process gives back Gt{A, B) 
for every t G [0, 1] and A,B E P{r). In other words in this case if we expand 
the function X* into a Taylor series around / we get 

Gt{I, X) = X' = I + t{X-I)- ^y^(X - If + Oie"") (110) 

and this equation for t = 1/2 gives that 62 = 1/8. Since Gt(A, B) is an affine 
mean and p does not explicitly depend on 62 we get p = 4t(l — t). Similar 
consideration can be applied in the case of the linear term t{X — /). □ 

The above proposition tells us that no matter how we choose the sym- 
metric matrix mean M{A, B), the series expansion of Mt{A, B) around I will 
have similar structure up to the {X — If term: 

MtiA, 5) = (1 - t)A + tB- 4^2^(1 - t)A^/^ {A-^'^BA-^/^ - if A^/^ + ... 

(Ill) 

Actually < 62 for every matrix mean since the corresponding operator 
monotone function is concave. It is not hard to prove using Proposition [6] 
that < 62 ^ 1/2 for every matrix mean. 

A remarkable consequence of the above series expansion is the following 
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M(A„5,) 



/+^(X-/) 



+ h2{y{ - z{) 



j\2 



Theorem 24. The BMP procedure converges at least cuhically if the matri- 
ces are sufficiently close to each other for all symmetric matrix means. 

Proof. This is so because we can mimic the proof of Theorem 3.2 in ^ 
which ensures the cubic convergence of the BMP process for the geometric 
mean. The only differences we need to take care of are in the series expan- 
sions. Namely that in property CI of Theorem 3.2 in [2| we will get 

T^^-\i:E,-''-^j:(E.-E,)\ (112) 

j=l i,j=l 

The other parts of the proof are just the same. □ 
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